Building a Domain-Specific French-Korean Lexicon

نویسنده

  • Ae-sun Yoon
چکیده

Korean government has adopted the French TGV as a high-speed transportation system and the first service is scheduled at the end of 2003. TGV-relevant documents are consisted of huge volumes, of which over than 76% has been translated in English. A large part of the English version is, however, incomprehensible without referring to the original French version. The goal of this paper is to demonstrate how DiET 2.5, a lexicon builder, makes it possible to build with ease domain-specific terminology lexicon that may contain multimedia and multilingual data with multi-layered logical information. We believe our wok shows an important step in enlarging the language scope and the development of electronic lexica, and in providing the flexibility of defining any type of the DTD and the interconnectivity among collaborators. As an application of DiET 2.5, we would like to build a TGV-relevant lexicon in the near future.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System

We describe in this paper a hybrid approach to build automatically bilingual lexicons of Multiword Expressions (MWEs) from parallel corpora. We more specifically investigate the impact of using a domain-specific bilingual lexicon of MWEs on domain adaptation of an Example-Based Machine Translation (EBMT) system. We conducted experiments on the English-French language pair and two kinds of texts...

متن کامل

Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology

This paper reports on the development of a French FrameNet, within the ASFALDA project. While the first phase of the project focused on the development of a French set of frames and corresponding lexicon (Candito et al., 2014), this paper concentrates on the subsequent corpus annotation phase, which focused on four notional domains (commercial transactions, cognitive stances, causality and verb...

متن کامل

A Topic Model for Building Fine-grained Domain-specific Emotion Lexicon

Emotion lexicons play a crucial role in sentiment analysis and opinion mining. In this paper, we propose a novel Emotion-aware LDA (EaLDA) model to build a domainspecific lexicon for predefined emotions that include anger, disgust, fear, joy, sadness, surprise. The model uses a minimal set of domain-independent seed words as prior knowledge to discover a domainspecific lexicon, learning a fine-...

متن کامل

Building a Maritime Domain Lexicon: a Few Considerations on the Database Structure and the Semantic Coding

In this paper we refer about the building we are carrying out of a specialized lexicon belonging to the maritime domain, together with the coding performed according to the ItalWordNet semantic relations model. The main characteristics of the lexical semantic database and the specific features of the specialized language are taken into consideration and, in particular, we concentrate our attent...

متن کامل

Building a computational lexicon of verbal syntactic constructions in French

We describe a computational lexicon which encodes verbal syntactic constructions in French. The aim of the project is to encode into a standard formalism the valency lexicon for French initiated in the eighties by M. Salkoff and A. Valli and continued since then. The whole computational lexicon will include about six thousand verbal entries with a list of the arguments of each verb, the restric...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002